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Abstract 

To factor an integer N, given that it is equal to the product of two primes p and q, it suffices to 
find an integer d < satisfying the test "C(2\/ Nd) 2 — ANd is a perfect square", where C denotes 
the integer ceiling function. In this approach, the factorization problem equates to the problem of 
designing an optimal data base D of values d to be tested. 
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1. THE BASIC TEST 

If x is a positive real number, let C{x) denote the smallest integer greater than or equal to x (the 
integer ceiling function). Let the function f{x) be defined by the formula f(x) = C{2^/x) 2 — Ax and let 
f 2 (x) denote the composition /(/(x)) . 

PROPOSITION 1.1. Let n be a positive integer. Then f(n) = if and only if n is a perfect square 
( \/n is a whole number). 

PROOF. If f(n) = 0, then C(2y / n) 2 = An. Thus, C{2y/n) = 2y / nis an even integer. Therefore, ^fn 
is an integer. Conversely, if y/n is an integer, then so is 2\fn . Thus, C(2y / n) = 2-^/n, which means that 
C(2^/E) 2 = An and f(n) = . 

PROPOSITION 1.2. Let n be a positive integer. Then f 2 (n) = if and only if there exist positive 
integers u and v such that n = uv and | ^/u — y^l < 1 . 

PROOF. Suppose that / 2 (n) = . By Proposition 1.1, f(n) = t 2 for some positive integer t . Thus, 
An = C(2y/n) 2 - t 2 . Note that C(2y/n) and t must be both even or both odd. Therefore, if we let 
u = ^(C(2y / n) + t) and v = ^(C(2^/n) — t) , then both u and v are integers and n = uv . To show that 
\t/u — y/v\ < 1 , observe that u + v = (7(2-^) , so that u + v — 2^pwv = C(2^fn) — 2y / n < 1 . That is, 
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\y/u — yjv\ 2 < 1 , which implies \^fu — y/v\ < 1 . Conversely, suppose n = uv where \y/u — y/v\ < 1 . By 
Proposition 1.1, to show that f 2 (n) = , it suffices to show that f(n) = t 2 for some integer t . If u = v , 
then n is a perfect square and there is nothing to prove. So we may assume that u/d. Let t = \u — v\ . 
Then t 2 = (u + v) 2 — Auv . Since \y/u — y/v\ < 1 , it follows that u — 2^/uv + v < 1 . Therefore, since 
u/t), we have 2^/uv < u + v < 2^pwv + 1 , so that u + v = C(2^/n) and t 2 = C(2y / n) 2 — An = f(n) . 
Thus, f 2 {n) = . 

COROLLARY 1.3. Suppose N is the product of two primes p and q where \y/p — y/q\ < 1 . Then 
the prime factors can be recovered explicitly in terms of N by way of the formulas: p = ^(C(2VN) + 1) 
and q = \(C{2yfN) - t) , where t = ^ C(2*jN f - AN . 

EXAMPLE 1.4. Take N = 176039 . Then t = ^C{2*/N) 2 - AN = 38 , p= \{C{2y/N) + t) = 439 
and q = |(C(2\/]V) -t) = 401 . Thus, the factorization N = (439) (401) follows instantly from the fact 
that VA39 - v/401 < 1 . 

PROPOSITION 1.5. To factor an integer N , given that it is equal to the product of two primes 
p and q , it suffices to find an integer d < \N satisfying the test f 2 (Nd) = . Then Nd = uv , where 
u = ±{C{2^fNd) +t), v= ±{C{2^fNd) - t) and t = \J C (2y/Nd) 2 — ANd . The prime factors p and q 
can be recovered separately as factors of u and v through the formulas p = gcd(iV, u) and q = gcd(iV, v) . 

PROOF. As in Proposition 1.2, with n = Nd, we have f 2 (Nd) = 0. Thus, Nd factors as Nd = uv 
where u = \{C{2y/Nd) + t) , v = \{C{2yfNd) - t) and t = ^C{2y/Nd) 2 - ANd. It remains to show 
that p and q are factors of u and v separately. A rough estimate is enough to prove this. Since, by 
Proposition 1.2, yju— y/v = 5 < 1 , we see v > u — 2y/u, so that, at least whenever u > 16, we have 
Nd = uv > u 2 — 2u2 > in 2 . If p and q both were factors of u , we would have u = apq = aN for 
some integer a which would imply that Nd > ^a 2 N 2 or d > ^a 2 N , contradicting the assumption that 
d < ^N . A similar contradiction occurs if we suppose that p and q both divide v . 

EXAMPLE 1.6. Take N = 1110757 and d = 170. Then f 2 (Nd) =0. Employing the formulas in 
Proposition 1.5, we get t = 23 , u = 13753 and v = 13730 . Thus N = pq , where p = gcd(iV, u) = 1373 , 
q = gcd(iV, v) = 809 . 

PROPOSITION 1.7. Suppose N = pq where p and q are distinct primes. An integer d < |iV 
satisfies the test f 2 (Nd) = if and only if d factors as d = xy where | ^/py — y/qx\ < 1 . 

PROOF. Suppose d satisfies the test f 2 (Nd) = 0. Then, as in Proposition 1.2, Nd = uv where 
\y/u — y/v\ < 1. By Proposition 1.5, we may assume that a is a multiple of p, say u = py , and v 
is a multiple of q, say v = qx . Thus, d = xy and \yfpy — y/qx\ < 1 • Conversely, suppose d = xy is 
found to satisfy the inequality \y/py — y/qx\ < 1 • Then, by Proposition 1.2, Nd = pyqx satisfies the test 
f 2 (Nd) = 0. 

2. A FACTORIZATION STRATEGY 

Given N = pq where the distinct prime factors p and q are unknown, our objective is to find an 
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integer d to satisfy the test f 2 (Nd) = . By Proposition 1.7, the test f 2 {Nd) = is successful if any 
one of the factorizations of d as d = xy satisfies the inequality \-y/py — y/qx\ < 1 • Thus, for a test value 
d , the likelihood of success increases as r(d) , the number of divisors of d , increases. To formalize this 
observation, we introduce a new function defined on finite sets of integers called the "yield function" . 

DEFINITION 2.1. Let d be a positive integer. The yield of d, denoted Y(d) , is the number of dis- 
tinct fractions < | < 1 in lowest terms such that xyz 2 = d for some integer z . If S = {d\, cfe, . . . , dk} 
is a set of test values, then the yield of S , denoted Y(S) , is the number of distinct fractions < | < 1 
in lowest terms such that xyz 2 G S for some integer z . 

EXAMPLE 2.2. Let d = 12 . The set of distinct fractions < | < 1 such that xyz 2 = 12 is 
{j2, |, |} . Note that the fraction | corresponds to the factorization 12 = (2) (6) = (1)(3)(2 2 ) . Thus 
Y(12) =3. 

EXAMPLE 2.3. Let S = {5, 12, 20} . The set of distinct fractions < | < 1 such that xyz 2 G S is 

jL, |, |, | , |} . Thus y(5) = 6 . Note that the factorization 5 = (1)(5) contributes nothing to the 
yield of S in view of the factorization 20 = (2) (10) . In fact, the yield of S is the same as the yield of 
the subset S' = {12,20} . 

Now, let D denote a finite data base of test values, say D = [d\, d2, ■ ■ ■ ,d m ] , structured as a list 
of integers in ascending order. By definition, the cost of factoring the integer N relative to T> is the 
number of values of d G T> which need to be tested before a successful value (satisfying f 2 (Nd) = ) is 
found. Obviously, our main objective is to construct a data base which minimizes the cost of factoring 
any given N of the form N = pq . Intuitively, at least, it appears fairly evident that some data bases 
will be more effective than others. Qualitatively speaking, the cost of factoring N relative to D should 
decrease as the yield Y(T>) increases. 

At this stage, we will attempt to supply at least a rough estimate of the effectiveness of a data 
base. Suppose N = pq , where p < q . In D we want to find d = xy such that yfpy — ^fqx < 1 . This is 
equivalent to 




This inequality is satisfied if the set of fractions | in the interval [0, 1] is so numerous that, for at least 
one of them, comes within a distance of ^7= of the fixed quantity . This will have a high 

probability of happening if 

Y(T>) > y/qy. (2) 

At this point, we need to formulate a reasonable estimate for y/qy . However, without any specific 
knowledge of the structure of T) this is difficult to do. We turn then to a discussion of some specific 
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choices of data base D . 

3. SOME SPECIAL DATA BASES 

The simplest data base is a list of consecutive integers starting at 1. Let T>o(m) = [1,2,3, . . . ,m] . 
To refine the inequality (2), note that y < m . However, the median divisor of a typical d G D is yd, 
the maximum of which is \Jm . It follows that \fm is a good candidate to represent y in the inequality 
Y(D) > y/qy . Thus, for the data base CDo(m) , (2) becomes 

F(DoM) > y/qtfU. (3) 

The explicit dependence on q can be removed by setting i? = | > 1 , so that ^fq = \f~R\f~N . Then (3) 
becomes 

y(D (m)) > y/mNR. (4) 

A sharp lower bound estimate of Y(T>o(m)) is given by m itself. To see this, note that X^ g 2) (m) Y(d) 
is bounded above by Xa<fc<m r (^) = 0(n\a.(n)) Thus, the data base Do(m) has a good chance of 
factoring N if m > mNR . Equivalently, T>o(m) has a good chance of factoring iV if iV < ^ . Often, 
a successful value of d G I>o(?7i) is found well before the entire data base is exhausted. This is borne out 
by extensive numerical experiments using MAPLE. 

A more interesting type of data base consists of the set of divisors of a given integer B . Let 
T>i(B) = [d\,d2, ■■■,d m ], where the dj are the divisors of B arranged in increasing order. Thus, 
m = t(B) and the largest element in Di(i?) is d m = B . Let B = p^p^pjj 3 ■■■Vk (P rmie power factor- 
ization), then m=(n + l)(r 2 + 1) ... (r k + 1) and Yy{B) = Y{T>i(B)) = (2n + l)(2r 2 + 1) ... (2r k + 1) . 
Thus, a good likelihood exists of factoring N provided that Y X {B) > \/\N) \f\R) \f{B) . 

Some interesting choices for B (evidenced by extensive numerical experiments using MAPLE): 
B = n\ 

B = (2) (3) (5) ... (pk) (product of first k primes). 
B = lcm(l,2,3, m) (1cm of first m integers). 
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